14.4 The Cell Cycle

189

Differences Between Prokaryotes and Eukaryotes (3)

The above considerations do not directly address the question of why prokaryotes

have rather compact genomes; they seem to be limited to about 10 million base

pairs (10 Mb) (and many bacteria living practically as symbionts in a highly con-

strained environment manage with far less). In a general sense, one can understand

that prokaryotes are under pressure to keep their genomes as small as practicable;

they are usually replicating rapidly under rr-selection (Sect. 14.9.4) and the need to

copy 1000 million base pairs would be physicochemically incompatible with a short

interval from generation to generation. On the other hand, most of the cells in a

metazoan are not replicating at all, and the burden of copying enormous genomes

during development is perhaps compensated for by the availability of plenty of raw

material for exploratory intraorganismal gene development (which the prokaryotes

do not need because of the facility with which they can acquire new genetic material

from congeners).

It has recently been shown that the nature of gene regulation also imposes certain

constraints on the relationship between the amounts of DNA assigned to coding (for

proteins) and those which are considered to be noncoding (i.e., corresponding to

regulatory sites such as promoters). According to what is known about the molecular

details of gene transcription (Sect. 14.8.2), to a first approximation each gene (with

an average length of about 300 base pairs) requires a promoter site (which might

have of the order of 10 base pairs). This gives 9:1 as the typical ratio of “coding” to

“noncoding” DNA in prokaryotes. 28 In the spirit of Wright’s “many to many” model

of regulation, gene regulatory networks are expected to be of the “accelerated growth”

type (see Sect. 12.2), because each new gene that is added should be regulatorily

connected to a fixed fraction German rr of the existing genes. Hence, if gg is the number of

genes, then the number of regulations (edges of the graph)r equals German r g squaredr = rg2. These regulations

are themselves mediated by proteins (the transcription factors) encoded by genes.

However, there is an upper limit to the number of interactions in which a protein can

participate, roughly fixed by the number of possible binding sites on a protein and

their variety; empirical studies 29 suggest that the upper limitk Subscript normal m normal a normal xkmax of the degreekk of the

network is about 14. Since k equals 2 r divided by gk = 2r/g, this suggests g Subscript normal m normal a normal x Baseline equals k Subscript normal m normal a normal x Baseline divided by left parenthesis 2 German r right parenthesisgmax = kmax/(2r), which would

appear to correspond to the 10 Superscript 7107 base pairs maximum genome size of prokaryotes.

As is well known, however, even allowing for possible overstatement in eukaryotic

genome length, far larger eukaryotic genomes are known to occur. Given their evident

regulatory success (as evinced by the real increase in organismal complexity), one

may suppose that the “accelerated growth” network model still holds; that is, all

of the additional proteins are properly regulatorily integrated. Ahnert et al. (2008)

have proposed that the regulatory deficit implied byg greater than g Subscript normal m normal a normal xg > gmax is met by “noncoding”

RNA-based regulation (see Sect. 14.8.4), the overhead of which is much smaller

28 Some groups of genes, typically those related functionally (such as successive enzymes in a

metabolic pathway), are organized into “operons” controlled by a single promoter site and are

therefore transcribed together.

29 Kim et al. (2006).